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METHOD OF IDENTIFYING THE FUNCTION OF A TEST AGENT 



RELATED APPLICATIONS 



V^-> Thi s application claims priority to USSN 60/177,416, filed January 21, 2000. The 

contents of this application are incorporated herein by reference in their entirety. 

5 

FIELD OF THE INVENTION 

The invention relates to biochemistry, molecular biology, and cell biology. 



BACKGROUND OF THE INVENTION 

The accumulation of raw nucleic acid sequence information for various organisms, 
coupled with the development of methods for identifying open reading frames encoding 
candidate proteins, is creating a need for methods that determine the function of previously 
unknown proteins. To date, functions of unknown proteins can be inferred by identifying genes 
whose expression changes (by increasing or decreasing) in the presence of the agent protein. 
However, such gene expression assays can be costly and labor-intensive. An effective and 
economical method for screening novel proteins for functions of interest is needed in the art. 

SUMMARY OF THE INVENTION 

The invention is based in part on the discovery of a system and method for rapidly and 
20 economically identifying the function of a test agent, such as a polypeptide, by examining 
changes in expression of genes in a plurality of cells contacted with the test agent. 

In one aspect, the invention includes a method of identifying the function of a test 
compound by contacting a plurality of cells with a test compound. The plurality includes at least 
a first cell and a second cell of a different type than the first cell type. Expression of one or more 
25 genes in cells or the plurality is measured. An alteration in the expression of the genes relative to 
the expression genes in a reference cell reveals the function of the test compound. For example, 
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if the test compound is a polypeptide and induces a gene expression pattern characteristic of a 
cytokine, the test compound is considered a candidate new cytokine. 

Preferably, the plurality includes three, four, five, six, or ten or more distinct cell types. 
Preferably, the expression of multiple genes, e.g., at least two, three, four, five, seven, and even 
5 ten genes is measured in one or more of the distinct cells in the array. 

For example, the method can include measuring the expression of at least two genes (and 
more preferably at least five genes) in the first cell, and, optionally measuring the expression of 
at least two genes (and more preferably at least five genes) in the second cell In a preferred 
embodiment, expression of one or more genes is also measured in a third cell, wherein the third 
10 cell is a different cell type from the first cell and the second cell. In a more preferred 
p% embodiment, expression of one or more genes is also measured in a fourth cell, wherein the 
^ fourth cell is a different cell type from the first cell, the second cell type, and the third cell type, 
CW Expression of a gene or genes in a cell exposed to a test agent can be compared to 

Si expression of the gene in a reference cell (e.g., otherwise identical cells not exposed to the test 
:i'5 agent). The reference cell may be processed in parallel to cells in the plurality; alternatively, 
% expression information for the reference cell can be stored in a database. 

fjS The plurality of cells is preferably provided in a container in which different cell types in 

^2 the plurality are spatially segregated. A preferred container is one in which the test agent can be 
13 added to the cells, after which the cells are lysed for isolating RNA. The container may in 
"20 addition include control cells, e.g., cells not exposed to a test agent. 

Examples of suitable test compounds include small molecules (typically molecules with 
molecular weights less than 1000 kDa) or larger macromolecules such as polynucleotides 
(including ribozymes) and polypeptides. Suitable polypeptides can also include antibodies. In 
some embodiments, two or more test compounds are added to the plurality of cells. 
25 While any kind of cell can be used in the method, preferred cells are mammalian (e.g., 

human) cells. Cells can be from established cell lines, or can be primary cells. Cell lines used in 
the method are preferably derived from multiple tissue types. Cell lines may be growth factor 
dependent or growth factor independent. Test compounds may be added in the presence or 
absence of serum. Cell lines may be derived from tissues of different species, but are preferably 
30 mammalian cells. Most preferably, the cells are derived from human cells. The cell can be 
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derived from a human tissue, i.e., a primary cell, or can be from an established {e.g., 
immortalized) cell line. 

Examples of cells suitable for use in the invention include MG-63 cells, U87-MG cells, 
TF-1 cells, HepG2 cells, THP-1 cells, HUVEC cells, CCD-1070SK cells, and Jurkat E6-1 cells. 
In some embodiments, a cell line of the invention is associated with a clinical indication, 
disorder or disease. 

Any method known in the art can be used to measure gene expression. A preferred 
method is polymerase chain reaction, e.g., real-time polymerase chain reaction. 

Also provided by the invention is a method of identifying the function of a test 
polypeptide by contacting a plurality of cells with the test polypeptide. The plurality includes a 
first mammalian cell, a second mammalian cell, and a third mammalian cell, wherein the first 
cell is a different cell type from the second cell type, the second cell type is a different cell type 
from the third cell type, and the third cell type is a different cell type from the first cell type. 
Expression of three or more genes is measured in the first cell, second cell, and third cell. An 
alteration in the level of expression of the gene relative to the expression of the genes in a 
reference cell indicates the function of the test compound. Expression is preferably measured 
using a polymerase chain reaction, e.g., a real-time polymerase chain reaction. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the invention, suitable methods and materials are described 
below. All publications, patent applications, patents, and other references mentioned herein are 
incorporated by reference in their entirety. In the case of conflict, the present specification, 
including definitions, will control. In addition, the materials, methods, and examples are 
illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention provides a method for rapidly and economically identifying the function of 
a test agent of interest by adding the test agent to multiple cell lines, and measuring changes in 
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gene expression of a predetermined set of genes in each cell line. By identifying those genes 
whose expression changes in the presence of the test agent as compared to the expression of the 
gene in the absence of the agent, it is possible to make inferences about the function of the 
polypeptide. The screen can be performed prior to, or contemporaneous with, other cell-based 
5 assays. These assays include assays measuring cell growth (bromodeoxyuridine ("BrdU") 

incorporation or the colorimetric 3-(4 ? 5)-dimethylthiazol-2-yl)-2,5-diphenyl-tetrazolium bromide 
("MTT") metabolism assay). 

Cell lines and genes examined are preferably chosen so that information on changes in 
gene expression in a cell will provide insight into the function of the polypeptide. Examples of 
1 0 cells and corresponding genes suitable for use in the methods in the invention are described in 
_ Table 1 . Genes for which changes in expression are associated with biological functions and 
y3 relevant clinical indications are provided in Table 2. Examples of additional cells include, e.g., T 
^ Cells, monocytes, B Cells, NK Cells, normal human osteoblasts (NHOst), astrocytes, 
g hepatocytes, and normal human lung fibroblasts. Additional genes to test for induced changes in 
ft? expression are CD23, IFNy, TNFa, and GCSF. 

^ Screening is conveniently performed in a container in which it is possible to culture cells, 

O add the test agent, and lyse cells for RNA isolation. The container segregates different cell types 
H and can in addition include control cells (e.g., cells not exposed to agent). For cells whose 
g growth is serum-dependent, the container may additionally include cells exposed to a test agent 
SO but not serum. 

A preferred container is a 96-well plate. A single well of a 96-well plate generates 
sufficient RNA for at least 12 PCR tests, thus allowing for the probing of 1 1 diagnostic genes 
plus a negative control (where the negative control maybe, for example, GAPDH minus RT) per 
cell line. Additionally, expression of a reference gene can be monitored in each well and serve as 
25 an internal control or standard. An example of such a reference gene is GAPDH. PCR plate 

layouts and cell culture techniques are commonly known within the art. Cell lysates can then be 
transferred to a second container, if desired, in which RNA is isolated and further manipulations 
(such as PCR-based analyses) performed. 

Genes whose expression is to be measured are preferably chosen for each cell line to 
30 provide detection of a broad spectrum of desired biological activities, e.g. , a cytokine-like 

activity in multiple cell types. A test compound that regulates the expression of at least one gene 
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in at least one cell type by a factor considered to represent a significant change in the level of 
expression is chosen for further analysis. In one embodiment, the factor of significant change is 
at least ± 4-fold. 

The invention will be further illustrated in the following non-limiting examples. 

EXAMPLES 

Example 1: Procedure for assessing polypeptide-mediated changes in gene expression in a 
plurality of cell types. 

On Day 1, adherent cells are plated in a 96-well flat bottom dish in 100 ul growth medium 
(2xl0 4 to 3xl0 4 cells/well). On Day 2, adherent cells are washed with starvation medium and 
100 ul starvation medium is added. Starvation medium contains 0.1% FBS for factor- 
independent cell lines (e.g., MG-63, U87-MG, HepG2, CCD-1070SK), or 2% FBS minus growth 
factors for factor-dependent cell lines (e.g., HUVEC). Suspension cells are plated in a 96-well 
round bottom dish in 100 ul starvation medium (1x10 s cells/well). Starvation medium contains 
0.1% FBS for factor-independent cell lines (e.g., THP-1, Jurkat), and 10% FBS minus growth 
factors for factor-dependent cell lines (e.g., TF-1). All cells are incubated for 24 hours. 

On Day 3, test compounds are added to the cells. Typically, 10 ul/well of a 1 OX stock for 
known proteins can be added. Alternatively, 10 to 100 ul/well of undiluted conditioned media 
for novel proteins may be used. Cells are incubated for 6 hr at 37°C. Cytoplasmic RNA is 
prepared from cells by centrifuging round-bottom plates containing suspension cells and 
discarding the supernatant. Supernatant from the flat-bottom wells containing adherent cells is 
also aspirated and discarded. 

RLN lysis buffer is added to all sample wells. Plates are centrifuged, and the lysates 
(supernatants) are transferred to Uneasy columns (96 column plate). RNA is washed and eluted 
in 160 ul RNase free water according to the manufacturer's instructions. 

On Day 4, 5, and 8, up to three plates of RNA samples are processed for TaqMan™ 
expression analysis. A master mix is prepared for each well as follows: 

1 OX TaqMan buffer (provided by the manufacturer) 2.5 ju.1 

M gCl 2 25 mM stock 5.5 a] 
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dNTP 


2.5 mM- 5.0 mM stock 


3.0 |Ltl 




AmpliTaq Gold 


5U/ml 


0.125 ^1 




Multiscribe RT 


50 U/ml 


0.125 (^1 




RNAse inhibitor 




1.0 ^1 


5 


Forward primer 


GAPDH, 10 \jM stock 


0.5 yl 




Reverse primer 


GAPDH, 10 iiM stock 


0.5 |^1 




Probe * 


GAPDH, 5 nM stock 


0.5 JL4.1 




Forward primer 


gene, 45 |jM stock 


0.5 ill 




Reverse primer 


gene, 45 uM stock 


0.5 jn.1 


10 


Probe * 


gene, 22.5 jaM stock 


0.5 fj.1 




dH 2 0 




2.25 |xl 




Total 




17.50 |al 



U] The GAPDH or other selected reference probe is labeled according to a standard 

15 TaqMan™ protocol, e.g., 5' ends are labeled with JOE, 3' ends with TAMRA; while the gene- 
~2 specific probes are labeled with a compound that may be monitored independently of the 
M reference probe, e.g., 5' ends with FAM, 3' ends with TAMRA. 

g For the TaqMan™ analysis, 17.5 jutl per well of the master mix is added to 96 well PCR 

^ plates containing 7.5 jul RNA sample per well Reaction conditions include 2 minutes at 50°C, 
20 10 minutes at 95°C, and 40 cycles of: 1 minute at 95°C, 0.40 minutes at 58°C, 1 minute at 72°C. 

Amplification is monitored by measuring the release of the fluorescent JOE and FAM markers 

during the 72°C extension step. 

Data are analyzed by comparing expression of each gene to GAPDH. To calculate 

changes in gene expression, gene expression in control samples is calculated and compared to 
25 the equivalent gene expression levels in the test compound-stimulated samples. 

Table 1. Cell lines and gene lists for expression analysis. 
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Cell TJne. Tissue Tvne 


Gene List 




MG-63, Osteosarcoma 


IRF-1* 


IL-8* 


TAP-1* 


LOX* 






OPG 


Factor B* 


** 

Collagen 


Collagenase 






BMP-3 


MxA* 


PCNA 






U-87MG, Astrocytoma 


IRF-1*** 


IL-8* 


MCP-1* 


ICAM-1 






c-Kit 


HLA-DR 


iNOS 


Tenascin-c* 






c-Myc 


VEGF 


GDNF 






TF-1, Erythroleukemia 


IRF-1*** 


beta-globin 


EpoR 


ICAM-1**** 






c-Kit 


Factor B 


GpIIb 


c-Mpl 






GBP-2 


STAT-1 


PCNA 






HepG2, Hepatoma 


IRF-1 


Haptoglobin 


PEPCK 


IGFBP1 






c-Kit 


CYP4A1 


Factor X 


CYP7A 






HMGCoA Rd 


Hexokinase 


ApoC3 






THP-1, Monocytic 


IRF-1 


Egrl 


TAP-1 


ICAM-1 


0 




CCR2 


HLA-DR 


iNOS 


IL-12 






TGF-betal 


MnSOD 


IL-10 






HUVEC, Endothelial 


PECAM 


Egrl 


VCAM 


ICAM-1 






Tissue Factor 


COX-2 


eNOS 


Endothelin-1 






KDR 


IL-6 


MMP-2 




fp 


CCD-1070SK, Fibroblast 


c-Myc 


IL-8 


FGF-2 


FGF-7 






c-Kit 


COX-2 


Factor III 


Endothelin-1 






HMGCoA Rd 


Hexokinase 


PCNA 






Jurkat E6-1, T-cell 


IL-2 


IL-3 


IL-4 


IL-2R 






CD69 


COX-2 


NFAT 


Fas Ligand 






Bcl-2 


LFA-1 


PCNA 





Highlighted genes were confirmed by GeneCalling on the indicated cell lines: 
* = up in IL-la treatment 



*% _ U p j n OPG treatment, down in thrombopoietin treatment 
*** = up in IFNy treatment 
5 **** = up in IL-6 treatment 

Remaining genes were selected based on TaqMan results and literature surveys. 



Table 2. Functional classification of gene probes. 



Functional 


Clinical indications: 




Gene List 




classifications: 










Angiogenesis 


Cancer 


PECAM 


VCAM 


COX-2 


wound healing 


Surgical and burn wound 


Endothelin-1 


Tissue Factor 


eNOS 




healing 


KDR 


MMP-2 


IL-8 




Gastric ulceration 


FGF-2 


FGF-7 


VEGF 


Inflammation 


Rheumatoid arthritis 


IRF-1 


ICAM-1 


MCP-1 




Crohn's disease 


HLA-DR 


iNOS 


Factor B 




Multiple sclerosis 


GBP-2 


Haptoglobin 


TAP-1 






CCR2 


IL-12 


TGF-betal 






IL-10 


MnSOD 


IL-6 
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Metabolism 


Obesity 


CYP4A1 


IGFBPl 


PEPCK 




NIDDM 


CYP7A 


HMCjCoA Rd 


ApoCi 




Cholesterol disorders 


MxA 


Hexokinase 




Coagulation 


Thrombocytopenia 


Factor X 


Factor III 






Hemophilia 








T-cell 


Immune deficiency 


IL-2 


IL-3 


IL-4 


activation 


Cancer immunotherapy 


IL-2R 


NFAT 


CD69 




Autoimmunity 


LFA-1 






Bone 


Osteoporosis 


LOX 


OPG 


Collagen 


formation 


Bone fracture 


BMP-3 


Collagenase 






Growth disorders 








Growth factor 


Neurodegenerative disorders 


c-Kit 


c-Myc 


PCiNA 


Cell cycle 


Cancer 


Bcl-2 


T~} 1 

Egrl 


r%s Ligand 


Apoptosis 


Autoimmunity 


GDNF 


Tenascin-c 




Hematopoiesis 


Immune deficiency 


beta-globin 


EpoR 


GpIIb 


Erythropoiesis 


Thrombocytopenia 


c-Mpl 


STAT-1 






Anemia 









In Table 2, many genes are associated with multiple activities, but are only listed once. For 
example, IL-8 could be listed in Angiogenesis and Inflammation; LOX could be listed in Bone 
formation and Inflammation; and Fas Test compound could be listed in Apoptosis and T-cell 
activation. A total of 62 distinct genes are represented. 



EQUIVALENTS 

From the foregoing detailed description of the specific embodiments of the invention, it 
should be apparent that particular novel compositions and methods involving analysis of novel 
protein function have been described. Although particular embodiments have been disclosed 
herein in detail, this has been done by way of example for purposes of illustration only, and is 
not intended to be limiting with respect to the scope of the appended claims that follow. In 
particular, it is contemplated by the inventors that various substitutions, alterations, and 
modifications may be made as a matter of routine for a person of ordinary skill in the art to the 
invention without departing from the spirit and scope of the invention as defined by the claims. 
Indeed, various modifications of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description. Such modifications are 
intended to fall within the scope of the appended claims. 
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